The Dockstore: enabling modular, community-focused sharing

ثبت نشده
چکیده

As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Translational Genomics, are using cloud-based infrastructure to both host and perform analysis across large data sets. In PCAWG, over 5,800 whole human genomes were aligned and variant called across 14 cloud and HPC environments; the processed data was then made available on the cloud for further analysis and sharing. If run locally, an operation at this scale would have monopolized a typical academic data centre for many months, and would have presented major challenges for data storage and distribution. However, this scale is increasingly typical for genomics projects and necessitates a rethink of how analytical tools are packaged and moved to the data. For PCAWG, we embraced the use of highly portable Docker images for encapsulating and sharing complex alignment and variant calling workflows across highly variable environments. While successful, this endeavor revealed a limitation in Docker containers, namely the lack of a standardized way to describe and execute the tools encapsulated inside the container. As a result, we created the Dockstore ( ), a project that brings together Docker images with https://dockstore.org standardized, machine-readable ways of describing and running the tools contained within. This service greatly improves the sharing and reuse of genomics tools and promotes interoperability with similar projects through emerging web service standards developed by the Global Alliance for Genomics and Health (GA4GH). This article is included in the Container channel. Virtualization in Bioinformatics 1 2 2 2

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows

As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Tran...

متن کامل

Presentation of a Causal Model of Campus Community Quality, Knowledge Sharing and Academic Performance among Students of Shiraz University of Medical Sciences

Introduction: As an intellectual and ethical environment, university campus increases motivation and enhances constructive interactions among students by creating and disseminating knowledge and leads to emergence of talents. The purpose of this study was to examine the relationship among university Campus community quality, knowledge sharing and students’ academic performance. Methods: This d...

متن کامل

Dialogue Systems and Planning

Planning processes are often used in dialogue systems to recognize the intentions conveyed in dialogue. The generation of utterances can also be achieved by a planning/execution mechanism. Some advantages of this kind of mechanism are: knowledge sharing, modular design, declarative description, etc. In this paper, we present some planning mechanisms and the related models enabling the dialogue ...

متن کامل

A Modular On-line Profit Sharing Approach in Multiagent Domains

How to coordinate the behaviors of the agents through learning is a challenging problem within multi-agent domains. Because of its complexity, recent work has focused on how coordinated strategies can be learned. Here we are interested in using reinforcement learning techniques to learn the coordinated actions of a group of agents, without requiring explicit communication among them. However, t...

متن کامل

The Role of Community Trust and Altruism in Knowledge Sharing: An Investigation of a Virtual Community of Teacher Professionals

The knowledge sharing process within a virtual community of teacher professionals is viewed as a social exchange process in that the knowledge sharing intention and behavior of individuals are influenced by the exchange relationship among members. However, relatively little research has focused on this approach to exploring the factors that enhance the knowledge sharing of individuals. This stu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017